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ABSTRACT 

^h ; 

^ ! We discuss the tests of nonGaussianity in CMB maps using morphological 

statistics known as Minkowski functionals. As an example we test degree-scale 
cosmic microwave background (CMB) anisotropy for Gaussianity by studying the 
QMASK map that was obtained from combining the QMAP and Saskatoon data. 
We compute seven morphological functions Mj(AT), % = 1, ..., 7: six Minkowski 
functionals and the number of regions N c at a hundred AT levels. We also 
introduce a new parameterization of the morphological functions M{(A) in terms 
P-T of the total area A of the excursion set. We show that the latter considerably 

decorrelates the morphological statistics and makes them more robust because 
they are less sensitive to the measurements at extreme levels. We compare these 
results with those from 1000 Gaussian Monte Carlo maps with the same sky 

,£h ! coverage, noise properties and power spectrum, and conclude that the QMASK 

map is neither a very typical nor a very exceptional realization of a Gaussian 
field. At least about 20% of the 1000 Gaussian Monte Carlo maps differ more 
than the QMASK map from the mean morphological parameters of the Gaussian 
fields. 

Subject headings: cosmology:theory — cosmic microwave background: anisotropy 



1. Introduction 

The issue of Gaussianity of CMB maps plays a crucial role in testing assumptions about 
the early Universe. The simplest inflation models strongly favor Gaussianity of the primordial 
inhomogeneities (see Turner 1997, and references therein), whereas other scenarios assuming 
cosmic strings or topological defects (see Vilenkin and Shellard 1994, and references therein) 
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predict non-Gaussian perturbations. Gaussianity is also a key underlying assumption of 
all experimental power spectrum analyses to date, entering into the computation of error 
bars (Tegmark 1997; Bond and Jaffe 1998), and therefore needs to be observationally 
tested. A third reason for studying Gaussianity of CMB maps is that it may reveal otherwise 
undetected foreground contamination. 

Numerous tests of Gaussianity in the COBE maps (Colley, Gott, and Park 1996; Kogut 
et al. 1996; Ferreira, Magueijo, and Gorski 1998; Pando, Valls-Gabaud, and Fang 1998; 
Bromley and Tegmark 1999; Novikov, Feldman, and Shandarin 1999; Banday, Zaroubi, and 
Gorski 2000; Barreiro et al. 2000; Magueijo 2000; Mukherjee Hobson; Aghanim, Forni, 
and Bouchet 2001; Phillips and Kogut 2001) have resulted in the general agreement that 
all non-Gaussian signals were of noncosmological origin. This was not unexpected because 
of COBE's low (7° deg) angular resolution. 

Testing Gaussianity on smaller scales may bring much more interesting results. The 
first study of Gaussianity on degree scale showed the consistency of the QMASK map with 
the assumption of the Gaussianity (Park et al. 2001). However, this study tested only 
the Gaussianity of the topology of the map. The study of the MAXIMA-1 anisotropy data 
also showed the consistency with the Gaussianity in the range between 10 arc-minutes and 
5 degrees (Wu et al. 2001). A total of 82 tests for Gaussianity were made in this study. 
Although performing many tests is always better than few it is not clear how independent 
these tests were. 

The question of statistical independence of different non- Gaussianity tests is compli- 
cated. However, in some simple cases some of the tests can be statistically disentangled. 
For instance, consider the probability function, which is perhaps the simplest test for non- 
Gaussianity. It easy to transform the probability function to any given form simply by 
relabeling the contours using a monotonic function. This relabeling obviously does not have 
any effect on the morphology of the field since every contour line remains the same as well as 
the order of levels due to monotonicity of the transformation. However, this transformation 
may strongly affect the whole hierarchy of the n-point correlation functions. This is of course 
due to well known connection of the probability function to all n-point functions. 

Genus represents only one statistic sensitive to non-Gaussianity. It may detect some 
types of non-Gaussianity and be not sensitive to the others. Consider, for instance, the 
following mapping of a Gaussian field <p(x,y) into Q(X, Y) 

$(X,Y) = <f>(x,y), (1) 

where 

X = x + ru(x,y), 
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Y = y + rv(x,y). (2) 

Equation 1 guarantees that every level label remains unchanged, while the level lines are 
shifted and deformed according to eqs. 2. Functions u(x,y) and v(x,y) can be any smooth 
functions including random functions. At small r before caustics have occurred, the mapping 
(x, y) — »■ (X, Y) is single valued and therefore the contour map of $(X, Y) is one-to-one image 
of (f)(x,y), although geometrically distorted. The genus of the &(X,Y) field as a function 
of the level remains exactly the same (i.e. Gaussian) as that of the <p(x,y) field because 
the mapping is continous and nonsingular and therfore preserves toplogy. On the other 
hand, the mapping (x, y) — ► (X, Y) can be neither isometric (conserving the lengths) nor 
area-preserving and therefore both the areas and the contours of the excursion sets may be 
distorted. As a result, the $(X, Y) can be a strongly non-Gausssian field having exactly the 
Gaussian genus in the level parameterization. Perhaps, it is worth noting that a particular 
case of the above example (u(x, y) = dip(x, y)/dx and v(x, y) = dip(x, y)/dy) corresponds to 
viewing the Gaussin contour map 4>(x, y) through a glass layer with varying thickness ip(x, y) 
(see e.g. Zel'dovich, Mamaev, and Shandarin (1983); Shandarin and Zel'dovich (1989)). 

A one-dimensional illustration of the above field is shown in Fig. 1. A realization of 
the Gaussian field <f>(x) is shown by the solid line and of the non-Gaussian field <$>(x) by the 
dashed line. The total area of the excursion set in 2D is analogous to the total length of 
the excursion set in ID and the genus in 2D is analogous to the number of peaks in ID. 
The level parameterization means the comparison of the number of peaks in the two fields 
at the same level. Both fields have the same number of peaks at every level as illustrated 
by the dotted and dash-dotted horizontal lines. However, the total lengths occupied by the 
peaks of the <fi(x) and <$>(x) fields are different at a given level. The length parameterization 
in ID means that the comparison of the numbers of peaks must be done at the same total 
length of the excursion sets which generally happens at different levels for the <p(x) and $(x) 
fields. For instant, the length of the excursion set of the Gaussian field at <f>(x) = 1.95 (the 
sum of two heavy line segments at level = 1.95) equals the length of the excursion set of the 
non-Gaussian field at $(x) = 1 (the sum of four heavy line segments at level = 1) . One 
can see at this total length the number of peaks in the Gaussian field is two while in the 
non-Gaussian field it is four. Thus, the number of peaks parameterized by the total length 
does detect this type of non-Gaussianity while the number of peaks parameterized by the 
level does not. Similarly, the area parameterization of the genus in two dimensions does 
detect this type of non-Gaussianity while the level parameterization of the genus statistic 
does not. 

If the mapping (eq. 2) is area-preserving than the genus of the $(X, Y) field remains ex- 
actly Gaussian also in the area parameterization. Thus, a detection of such non-Gaussianity 
requires additional tests. 



-4- 



We use two different parameterization of the measured morphological characteristics and 
show that the parameterization by the total area of the excursion set (i.e. the cumulative 
probability function) has an advantage over parameterization by the temperature because it 
gives considerably smaller correlations between different measures. In particular, the total 
area parameterization detects non-Gaussianity by the genus statistic in the above example 
while the temperature parameterization does not. For additional discussion of this issue, see 
Shandarin (2002). 

Along with the QMASK map (Xu et al. 2001), we analyze a thousand reference maps 
with the same sky coverage, noise properties and power spectrum as the QMASK map. For 
each map we compute seven morphological functions at a hundred temperature/area levels. 
These functions are: 

1. the total area of the excursion set A(AT), 

2. the total contour length C(AT) and C(A) , 

3. the genus of the excursion set G(AT) and G(A) , 

4. the area of the largest (by area) region A p (AT) and A p (A) , 

5. its perimeter C P (AT) and C P (A), 

6. genus G P (AT) and G P (A), and finally 

7. the number of regions N C (AT) and N C (A) . 

The first three are the global Minkowski functionals and the following three are the Minkowski 
functionals of the largest (by area) region. The largest region at some threshold spans the 
whole map, or "percolates" . Its Minkowski functionals are the best indicators of the perco- 
lation phase transition (Shandarin 1983; Yess and Shandarin 1996). Our notations reflect 
some of the jargon used in cluster analysis: A p , C p , G p for the Minkowski functionals of the 
percolating region and N c for the number of regions, often referred to as "clusters" in cluster 
analysis. 

We then calculate the mean for each of these fourteen functions (Mi (AT) and Mi(A), i = 
1, • • • , 7) computed from a thousand Gaussian maps and quantify the differences of every 
function for every map with respect to the mean functions in quadratic measure. Finally, 
we compute these differences for the QMASK map and test the Gaussianity hypothesis by 
comparing them to the thousand reference maps. 
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The rest of the paper is organized as follows. In Sec. 2 and 3 we briefly describe the 
QMASK map and the Gaussian simulations thereof. In Sec. 4 we define the morphological 
functions, their parameterization and the method of quantifying the departure from the 
Gaussianity. In Sec. 5 we summarize the results. 



2. QMASK Map 

The QMASK map, described in (Xu et al. 2001), combines all the information from 
the QMAP (Devlin et al. 1998; Herbig et al. 1998; de Oliveira-Costa et al. 1998) 
and Saskatoon (Netterfield et al. 1995, 1997; Tegmark et al. 1996) experiments into 
a single map at 30-40 GHz covering about 648 square degrees around the North Celestial 
Pole. The map consists of sky temperatures in 6495 sky pixels, conveniently grouped into a 
6495-dimensional vector x, with a FWHM angular resolution of 0.68°. As detailed in Xu et 
al. (2001), all the complications of the map making and deconvolution process are encoded 
in the corresponding 6495 x 6495 noise covariance matrix N. The map has a vanishing 
expectation value (x) = and a covariance matrix given by 

C = (xx*) = N + S, (3) 

where the signal covariance matrix S is given by 



^m+i) 



S ^E^( ? .'^>" 9W ^ ( 4 ) 

1=2 

the Pe are Legendre polynomials, Ce is the angular power spectrum, rj is a unit vector 
pointing towards the i th pixel, and 6 =FWHM/v8 In 2 is the rms beam width. When 
computing S in practice, we use the smooth power spectrum of (Xu, Tegmark, and de 
Oliveira-Costa 2001) that fits the observed QMASK power spectrum measurements. 

Since the raw map has a large and complicated noise component, we work with the 
Wiener-filtered version of the map in this paper, shown in Fig. 2 and defined as 

x„, = Wx, W = S(N + S)" 1 . (5) 

This approach was also taken in Park et al (2001) and Wu et al (2001). 



3. Mock Maps 

In order to quantify the statistical properties of our Minkowski functionals, we need large 
numbers of simulated QMASK maps. We therefore generate one thousand mock Gaussian 
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maps as follows. The covariance matrix C w of the Wiener-filtered map is given by 

C w = (x^) = ((Wx)(Wx) ( > = W(xx')W = WCW', (6) 

and we Cholesky-decompose it as C^ = LL*, where the matrix L lower triangular. It is 
straightforward to show that if z is a vector of independent Gaussian random variables with 
zero mean and unit variance (which is trivial to generate numerically), i.e., (zz*) = I, the 
identity matrix, then the mock maps defined x m = Lz will have a multivariate Gaussian 
distribution with (x m x^) = C w . 

4. Morphological Statistics 

In this paper we use a set of morphological statistics based on Minkowski functionals. As 
suggested by (Novikov, Feldman, and Shandarin 1999), one can use both global and partial 
Minkowski functionals in studies of CMB maps. In this study we use the global Minkowski 
functionals and those of the largest region by area (the so-called percolating region). 

4.1. Global Minkowski functionals 

Global Minkowski functionals were introduced into cosmology as quantitative measures 
of CMB anisotropy by Gott et al. (1990) although without reference to the Minkowski 
functionals and thus without stressing their unique role in differential and integral geometry. 
Mecke, Buchert, and Wagner (1994) were the first to place the studies of morphology of the 
large scale structure in the context of differential and integral geometry. In particular, they 
emphasized a powerful theorem by (Hadwiger 1957) stating that with rather broad restric- 
tions, the set of Minkowski functionals provides a complete description of the morphology 
(for further discussion see Kerscher 1999). 

Global Minkowski functionals describe the morphology of the excursion set at a chosen 
threshold: A(AT), C(AT) and G(AT) are the total area, contour length and genus of 
the excursion set, respectively. The Minkowski functionals of Gaussian fields are known 
analytically as functions of the threshold. Assuming that the field is normalized, i.e. u — 
(AT— < AT >)/cat so that < u >= 0, < u 2 >= 1, the Minkowski functionals are 

A(u) - 
C{u) - 
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G(u) = ^L Uffl q>(-^), (7) 

where R = v2/o"i is the characteristic scale of fluctuations in the field; a± is the rms of the 
first derivatives (in statistically isotropic fields both derivatives du/dx and du/dy have equal 
rms). A(u) is the fraction of the area in the excursion set and thus equals the cumulative 
probability function: A(u) = J p(u')du', where p(u) is the probability function. C(u) 
and G(u) are the length of the contour and the genus of the excursion set per unit area 
respectively. 

In this study we use somewhat different normalization and units. We measure the total 
contour length in the mesh units and we define G as the number of isolated regions minus 
the number of holes in the excursion set. 



4.2. Percolating region 

The morphology of every isolated region in the excursion set can be characterized by 
three partial Minkowski functionals : the area, boundary contour length and genus of the 
region. In order to describe the morphology of the field in more detail, Novikov, Feldman, 
and Shandarin (1999) used the distribution functions of the partial Minkowski functionals 
at several level thresholds. Bharadwaj et al. (2000) used the averaged shape parameters 
in the study of the morphology in the LCRS slices. Here we choose to utilize only the 
Minkowski functionals of the largest region (A p (AT), C P (AT), G P (AT)) in addition to the 
global Minkowski functionals . This is dictated by the relatively small size of the QMASK 
map: the maximum number of isolated regions is about 30 and therefore the distribution 
functions are very noisy. The Minkowski functionals of the largest, i.e. , percolating region 
of Gaussian fields are not known in an analytic form but can be easily measured in Monte 
Carlo simulations (Shandarin 2002). 

An additional comment may be useful. A naive idea of percolation invokes an image of 
the region connecting the oposite sides of the map. Thus, the question may arise: what does 
percolation mean in a map with complex boundaries and holes like the QMASK map (Fig. 
2). In particular, which points must be connected at the percolation threshold in a map with 
irregular boundaries? The fact is that percolation is a phase transition which takes place re- 
gardless of the shape and topology of the map (see e.g. Stauffer and Aharony (1992)). It can 
be studied and relatively easily measured in maps of arbitrary shapes including holes. The 
Minkowski functionals of the largest region are particularly useful while the direct detection 
of the connection between the oposite sides is much more unstable even in simple square 
maps (Dominik and Shandarin 1992). At percolation transition the area and perimeter of 



the largest region experience a rapid growth while the genus rapidly decreases (Shandarin 
2002). Similarly to many other statistics the percolation transition is affected mostly by the 
size of the map and its effective dimensions and much less by its shape or presence of holes. 
For example, if a three-dimensional map has the shape of a thin wedge then percolation 
becomes two-dimensional (Shandarin and Yess 1998). Likewise, a two-dimensional map 
having the shape of a narrow strip effectively percolates as a one-dimensional map. More 
accurately, if the smallest size of the map is smaller than the scale of the field then the 
percolation transition effectively reduces the dimensions. A simple visual inspection of Fig. 
2 shows that nothing of the kind of the problems discussed above is present in the QMASK 
map. The major drawback of the map is its size not the shape. 

In addition, we wish to stress that we do not compare the percolation curves measured 
for the QMASK map with the standard Gaussian curves. Instead we compare them with 
the curves measured in the reference maps having exactly same irregular shape and holes 
as the QMASK map. Therefore, the question of accuracy of reproduction of the standard 
percolation properties in two-dimensional maps becomes largely irrelevant. 



4.3. Numerical technique 

The numerical technique used for measuring the Minkowski functionals is described 
in detail in (Shandarin 2002). Here we outline the main features that may differ from 
numerical methods used by others (Coles 1988; Gott et al. 1990; Schmalzing and Buchert 
1997; Winitzki and Kosowsky 1997; Schmalzing and Gorski 1998; Park et al. 2001; Wu 
et al. 2001). 

First, we use neither Koendernik invariants (Koenderink 1984) nor Croftons's formulae 
(Crofton 1968) which are often used in other studies. A pixelized map is considered to be 
an approximation to a continuous field. This approach was used by Novikov, Feldman, and 
Shandarin (1999) but here we use a different and more refined version of the code computing 
Minkowski functionals . The major effect of this improvement is the numerical efficiency of 
the code. 

The major features of our technique are illustrated by Fig. 3. Suppose we use a 
regular square lattice and the true contour corresponding to u — u t h is a simple ellipse 
shown by the solid line in Fig. 3. The sites satisfying the threshold condition u > u t h are 
shown by filled triangles, while the sites with u < Uth are shown by empty triangles. The 
union of elementary squares (shown by heavy solid lines) placed on each site satisfying the 
threshold condition is widely used as an approximation to the region. One can easily count 
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the number of elementary squares, number of edges of the elementary squares and number 
of vertices in the union. Then by using Crofton's formulae one computes the area, perimeter 
and the Euler characteristic of the region. For instance, the area in this approximation is 
simply the sum of areas of the elementary squares. The perimeter is the number of external 
edges of the elementary squares. While the area converges to the right value as the lattice 
constant approaches zero, the perimeter does not (for a detailed discussion of this effect see 
e.g. Winitzki and Kosowsky (1997)). The perimeter of the set of the elementary squares 
obviously equals the perimeter of the rectangular region shown by the dotted line. As the 
lattice parameter approaches zero the perimeter of the region approches the perimeter of 
the large rectangle shown by the solid line which obviously is a wrong value. Assuming the 
isotropy of the map one can show that the mean length of a segment is 4/ir times larger 
than its true value (Winitzki and Kosowsky 1997) and can be corrected only statistically. 
Our method is completely free of this drawback and does not need such a correction. 

In our approach we begin with the construction of the contour points for each isolated 
region of the excursion set. Suppose that u(i, k) < Uth but u(i + 1, k) > Uth then the contour 
point lies somewhere between (i, k) and (i + l,k) sites. Its coordinates (x, y^) can be obtained 
by solving the linear interpolation equation 

/ , n u(i + l,k) — u(i,k) , , ,,, 

u th = u(i, k) + . ', ^i - x(i, k)) 8 

x{i + 1, k) — x(i, k) 

for the coordinate x. We construct the contour points by linear interpolation of the field 
between the sites in both horizontal and vertical directions. These points are marked by the 
solid circles in Fig. 3. Then, computing the perimeter we assume that the contour points 
are connected by the segments of a straight line (dashed line). The contours made by this 
method converges to the right value of the perimeter as the lattice constant approaches zero. 

The other major difference consists in allowing a direct linkage of diagonal grid sites in 
some cases depending on the values in the four neighboring sites. Consder a case when two 
diagonally adjacent pixels satisfy the threshold condition: u(i, k) > u t h and u(i+ l,k+l) > 
u th but the sites u(i + 1, k) < u t h and u(i, k + 1) < u t h- One usually chooses either consider 
the (i, k) and (i + 1, k+ 1) sites always directly linked or always not linked implying that the 
difference between the two alternatives disappears with vanishing the lattice parameter. Our 
definition of the direct linkage of two diagonally adjacent pixels is different from both. The 
diagonally adjacent pixels are directly linked if u mean = (u(i, k) + u{i + 1, k) + u(i, k + 1) + 
u(i + l,k + l))/4 > u t h and directly not linked otherwise. In the latter case they of course 
are allowed to be linked by the friend of friend procedure if they have proper friends. This 
method affects the number of regions, their sizes and the number of holes in the regions and 
therefore the values of all Minkowski functionals (including genus). The visual inspection of 
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Fig. 4 and Fig. 4 in Park et al. (2001) reveals small but noticeable differences in the genus 
curves. This treatment of the direct diagonal linkage also considerably reduces the intrinsic 
anisotropy of a rectangular grid. Since the QMASK map occupies a small part of the sky, 
we ignore the intrinsic curvature of the map when computing the areas and perimeters. 

If a part of the region boundary is the boundary of the map mask, then it does not 
contribute to the contour length of the region. Similarly, the holes in the map mask contribute 
neither into the genus nor the contour length. 

The analysis of one realization of a Gaussian random field at a hundred levels takes 
about 0.7s, 2.9s, 12.3s, and 73s for 64 2 , 128 2 , 256 2 , and 512 2 maps respectively on HP 
C240 workstation. It corresponds approximately to the t oc N pix ln(N pix ) dependence of the 
computational time on the size of the map. 



4.4. Parameterization by the level of AT. 

In addition to six Minkowski functionals , we computed the number of isolated regions 
N c (Novikov, Feldman, & Shandarin 1999). Thus, in total we compute seven morphological 
parameters 

M,(AT) = [A(AT),C(AT),G(AT),A p (AT),C p (AT),G p (AT),N c (AT)], i = 1,...,7 (9) 

at a hundred threshold levels AT for the QMASK map and the thousand reference maps. 
The six panels of Fig. 4 show six Minkowski functionals measured for the QMASK map 
(solid lines) and the mean values for a thousand Gaussian maps (heavy dashed lines). The 
thin dashed lines show the 68% and 95% intervals. 

The genus curve (left bottom panel) is in good qualitative agreement with that obtained 
by Park et al. (2001). However, it is worth noting that we use a different normalization 
and a somewhat different variable on the horizontal axis: we use the level normalized to 
the rms u = AT/o"at, while Park et al. use a more complex quantity v& related to the 
excursion set of the Gaussian field. Park et al. describe their parameterization as follows: 
"the area fraction threshold level z/4 is defined to be the temperature threshold level at 
which the corresponding isotemperature contour encloses a fraction of the survey area equal 
to that at the temperature level z/4 for a Gaussian field" (the bottom of p. 586 ). Some 
quantitative differences may be also ascribed to a different linkage scheme used by Park et 
al. (see the previous section). We wish to stress that the parameterization by v& has similar 
properties (including the noise correlation properties) with the parameterization by the area 
(A) discussed in the following section. The only difference between them is the geometrical 
interpretaion. 
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The number of isolated regions is shown in Fig. 5 for both the temperature and global 
area parameterizations. The latter is described in detail in Sec. 4.5. 

In order to quantify the degree of non-Gaussianity of the QMASK map, we define and 
compute a functional that quantifies the "distance" between two functions in the functional 
space of measured statistics. The distance is defined by the integral 

C /-oc ^ 1/2 

D\f ] = |y JiMAT)-M,(AT)] 2 d(AT)j , 

i = 1,...,°7; fc = l,...,1000 (10) 

where 

-. 1000 

M,(AT) = — ^Tm^(AT) (11) 

fc=i 
is the mean value of the statistic as a function of AT. The integral in eq. (10) is approxi- 
mated by the sum over a hundred levels of AT. We also compute the distances of the each 
morphological curve from the Gaussian mean for the QMASK map. The third column of 
Table 1 shows the percentile of the QMASK map distances from the mean for each mor- 
phological statistic parameterized by AT. The highest departure from the mean is shown 
by the A (AT) curve which is the cumulative probability function. It shows that only about 
27% of Gaussian curves are closer to the Gaussian mean than the QMASK curve. Other 
parameters are even closer to the corresponding Gaussian mean. 



4.5. Parameterization by the total area A. 

The parameterization of the morphological parameters by the level of the field u has 
serious drawbacks mentioned in Introduction. Although the parameterization by va used 
in most studies of the genus is free of these drwbacks it seems to be more natural for 
morphological studies to use the total area A (AT) = Aes(AT) / A m (where A^s(AT) is the 
area of the excursion set at AT and A m = Aes{— oo) is the total area of the map) as an 
independent parameter (Yess and Shandarin 1996). It is one of the Minkowski functionals 
measured for the map, it represents the cumulative distribution functions of the field, and in 
addition it has a very simple geometrical interpretaaiton. It also has an additional advantage 
of decorrelating different statistics as discussed bellow. The va parameterization must have 
similar properties because it is related to A by a monotonic transform (the first eq. of 7). 

The transformation from the level parameterization to the area parameterization is a 
highly nonlinear procedure illustrated in Fig. 6. The thick lines correspond to the QMASK 
map (solid) and a randomly chosen Gaussian map (dashed). The transformation from C(AT) 
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to C(A) involves the A(AT) function shown in the lower panel, which is different for each 
map. The thin solid lines show the transformation for the QMASK map and dashed lines 
for the Gaussian map. Note that in order to illustrate this transformation, we reversed the 
A axis in the C = C(A) plot. 

As an illustration of the diference between the level and area parameterization consider 
a simple example of a strong nonlinear field. Let the field w(x,y) is the exponent of a 
Gaussian field u(x,y): w = exp(aru). The global Minkowski functionals of such a field can 
be easily derived from eq. 7 
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Fig. 7 illustrates the diference between the Minkowski functionals of the Gaussian field 
u and non-Gaussian field w (with a — 1) if both are parameterized by the level. The 
percolation curves differ strongly as well. On the other hand both the perimeter and genus 
as the functions of the area remain identical for the both fields because the transformation 
simply relabels the levels but does not affect the geometry of the contours. This is an example 
of a trivial non-Gaussian field when all non-Gaussian information is stored in the one-point 
probability density function. The further discussion of such fields can be found in Shandarin 
(2002). 

Using the area parameterization, we compute seven new morphological functions for 
each map: 

Mi(A) = [AT(A),C(A),G(A),A p (A),C p (A),G p (A),N c (A)}, i = l,...,7. (13) 

Figure 8 shows the measured quantities as functions of A. Similarly to Fig. 4 six panels 
show six Minkowski functionals measured for the QMASK map (solid lines) and the mean 
values for a thousand Gaussian maps (heavy dashed lines). The thin dashed lines show the 
68% and 95% intervals. 

The difference of the QMASK curves from the Gaussian mean is quantified similarly to 
that described before by the distance between curves in the functional space: 
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i = 1,...,7; fc = l,...,1000 (14) 

where 

MJA) = 

1000 

fc=l 

There is an important difference between the two distances defined by eq. 10 and 14. Using 

dA = —p k (AT)d(AT) where p k (AT) is the estimate of the probability function derived from 

the k-th map one can rewrite eq. 14 as an integral over AT: 



1 1000 

M l {A) = —-Y,M lk {A). (15) 



D\t ] = [l [M tk (A) - M t (A)} 2 ct4.} 

■ poo \ 1/2 

J [M ik (AT) - M,(AT)] 2 Pk (AT) d(AT)\ (16) 



1/2 

1/2 



Comparing eq. (10) and (16), one notices that the distance between the curves computed 
in the area space (eq. 14) depends more on the bulk of the probability function and less on 
the tails of the probability function, since it is weighted by the probability function. This 
obviously makes the measure more robust because the measurements at extreme levels are 
subject to large fluctuations. It is worth stressing that the calculation of all morphological 
parameters M ik themselves (eq. 9 and 13) is independent of the type of parameterization 
because Mi k (A) = M{ k (AT) provided that A is measured at corresponding AT: A = A(AT). 
The only quantities that depend on the parameterization are the distances from the Gaussian 
expectation functions (eqs. 10 and 14) that are used for ranking the maps. 

The second column of Table 1 shows the percentile of the QMASK distances from the 
mean in the area parameterization. Comparing the values corresponding to different param- 
eterizations, one can easily see that the parameterization by AT suggests that the QMASK 
map is a more typical example of a Gaussian field than that using A as an independent 
parameter. The parameterization by the level suggests that the most exceptional deviation 
of the QMASK map from Gaussianity is given by the A (AT) curve, but more than 70% of 
Gaussian maps differ from the mean more than the QMASK map. Thus, the QMASK map 
looks like a very typical realization of the Gaussian field. Parameterizing by the total area, 
one may conclude that the Gaussian P(A) and N C (A) curves differ more than the QMASK 
map in only about 21% and 18% of the cases respectively. Thus, this parameterization re- 
veals that the QMASK map is not quite a typical realization of the Gaussian field, but is 
not very unusual either, quite consistent with Gaussianity. 

It is perhaps worth noting that all statistics but the genus of the percolating region, G p 
show a greater difference of the QMASK map with respect to the corresponding expectation 
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value of the set of Gaussian maps in the A parameterization than that in AT parameteriza- 
tion. Since the difference between two parameterizations for G p is not large (12% and 21%) 
it maybe a statistical fluctuation. One also may note that two columns of Table 1 appear 
anti-correlated. As we have discussed before the quantities shown in two columns of Table 1 
have different sensitivity to the measurements at extreme AT levels both highest and lowest. 
The numbers in the first column corresponding to the A parameterization are more robust. 



4.6. Cross-correlations 

The parameterization of the morphological statistics by the total area makes the set of 
parameters M^A) (eq. 13) less correlated than Mj(AT) (eq. 9). Table 2 shows the correlation 
coefficients of the distances 

1000 

Y t (D lk -D, i )(D jk -D j ) 
r H =: T^Z-^ 1 — — TTT75 ( 17 ) 



/1000 \ /1000 

E (A* - A) 2 E (D jk - A) 2 

\fc=l / \fc=l 



for all 21 pairs of morphological statistics. The values above the diagonal show the correla- 
tions when all statistics were parameterized by the total area of the excursion set, A, while 
the values below the diagonal were computed using the parameterization by the level, AT. 

The correlation coefficient shows for each pair of statistics how well the separation from 
the mean Gaussian curve in one statistic Mj can be predicted from the other one M y The 
closer Tij to unity, the less independent are corresponding statistics. In this respect the AT 
and A parameterizations are very different. For instance, the lowest correlation in the class 
of the level parameterization is 0.82 between G P (AT) and iV c (AT) distances and 16 out of 
21 cross-correlations are greater than 0.9. In the class of the area parameterization, only one 
correlation coefficient is greater than 0.82: that between G(A) and N C (A) distances (0.85). 
Seventeen out of 21 pairs correlate at a level lower than 0.5 thirteen of which at the level 
lower than 0.3. Although absence of correlation does not prove the statistical independence of 
two sets of numbers, a correlation coefficient approaching unity indicates a strong statistical 
dependence. 



5. Discussion 

We present the results of a study of Gaussianity in the QMASK map. In agreement with 
the first study of topology of the degree-scale CMB anisotropy (Park et al. 2001), which 
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was limited to the genus of QMASK, we conclude that the QMASK map is compatible with 
the assumption of Gaussian AT /T fluctuations. In this study, we used six morphological 
statistics in addition to the genus, G: the areas A and A p of the excursion set and the largest 
(i.e. percolating) region, the contour lengths or perimeters C and C p of the excursion set 
and the largest region; the genus G p of the percolating region and the number of isolated 
regions N c . According to its morphology, the QMASK map is not a very typical example of 
a Gaussian field, but not very exceptional either: about 20% of a thousand Gaussian maps 
have greater differences with at least one mean function (Table 1). 

We show that the parameterization of the morphological statistic is very important. 
A naive parameterization by the temperature threshold AT results in a highly correlated 
(almost degenerate) set of morphological characteristics: 16 out of 21 cross-correlation co- 
efficients are greater than 0.9 and all the rest are greater than 0.82 (Table 2, bellow the 
diagonal). Representing the same morphological parameters as functions of the total area 
of the excursion set A helps to decorrelate them: now 13 pairs correlate less than 0.3, four 
more correlate less than 0.5 and the rest less than 0.85 (Table 2, above the diagoanl). 

Our measurements of the deviations of the morphological functions from the Gaussian 
mean strongly rely on the bulk of the probability function (see eq. 16) and therefore are 
robust to spurious effects. 

In the A parameterization the strongest deviations of the QMASK map from the Gaus- 
sian expectation values were shown by the number of peaks, N C (A) (82% i.e. the QMASK 
map deviates from the Gaussian expectation number of peaks more than 82% of Gaussian 
maps), total contour length of the excursion set, C(A) (79%) and area of the percolating 
region, A p (A) (55%) while in the AT parameterization by the total area of the excursion 
set i.e. the cumulative probability function, A(AT) (27%), genus of the percolating region, 
G P (AT) (21%) and contour length of the percolating region, C P (AT) (20%). 

The techniques that we have described here are computationally efficient (oc 0(N P i X In N P i X )), 
and should be useful also for much larger upcoming datasets such as that from the MAP 
satellite. Another promising area for future applications is computing Minkowski function- 
al from foreground maps and topological defect simulations, thereby enabling quantitative 
limits to be placed on the presence of such structures in CMB maps. 
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Table 1. Percentage of Gaussian maps deviating less than QMASK. 



Parameterized by Area, A Parameterized by Level, AT 



Level, AT 


28.2% 




Area, A 




27.2% 


Perimeter, C 


78.8% 


6.1% 


Genus, G 


48.3% 


2.0% 


Area (Perc), A p 


54.6% 


12.1% 


Perimeter (Perc), C p 


24.3% 


19.7% 


Genus (Perc), G p 


11.6% 


21.4% 


Number of regions, N c 


82.5% 


0.4% 



-21- 



Table 2. Correlations between different non-Gaussianity statistics parametrized by 
temperature, AT (below the diagonal) and area, A (above the diagonal). 
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Fig. 1. — One-dimensional illustration of a non-Gaussian field $ (dashed line) having exactly 
same number of peaks (which is analogous to genus in 2D) in the level parameterization as 
the Gaussian field <fi (solid line). Compare the number of peaks in two fields at two levels 
marked by the dotted and dash-dotted horizontal lines. The length parameterization in ID 
is analogous to the area parameterization in 2D. In order to compare the number of peaks in 
the length parameterization one has to count the peaks of the two fields at different levels. 
The marked levels are chosen in such a way that the total length of the excursion set of the 
non-Gaussian field at $ = 1. (the sum of four heavy segments) equals the total length of the 
the excursion set of the Gaussian field at (p = 1.95 (the sum of two heavy segments). 
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Fig. 2. — Grey scale QMASK map. Light color correspond to higher temperatures. Note 
that there are three clearly distinct white regions in the map where data are absent. 
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Fig. 3. — An illustration of the numerical technique. The solid line shows the elliptical 
contour corresponding to the certain threshold u = Uth- Triangles mark the sites of the 
lattice: solid with u > Uth and empty with u < u t h- Solid circles mark the contour points 
satisfying the condition u = u t h (the roots of eq. 8); the dashed line is the resulting contour. 
The true area of the region is the area within the solid ellipse. We approximate it by the 
area within the dashed contour while in most works it is approximated by the sum of areas 
of the elementary squares (solid squares). We approximate the perimeter by the length of 
the dashed contour while in other works it is often approximated by the sum of the external 
edges of the elementary squares. One can easily see that this approximation gives the value 
of the perimeter of the large dotted rectangle. As the lattice constant approaches zero it 
converges to the perimeter of the large solid rectangle, while our perimeter converges to the 
true value of the perimeter of the solid ellipse. 
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Fig. 4. — Minkowski functionals as functions of u — AT/<7at for the excursion set (left hand 
side column) and for the percolating region (right hand side column). Top row: the area 
fraction A and A p , middle row : contour length C and C p , bottom row: genus G and G p . 
The contour lengths are given in mesh units. The genus is "number of regions" — "number 
of holes" . The solid lines show the parameters of the QMASK map, heavy dashed lines show 
the median Gaussian values, thin dashed lines show 68% and 95% ranges. 
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Fig. 5. — The number of regions N c as a function of the level u = AT/ctat (left hand panel) 
and as a function of A (right hand panel). Other notation is as in Fig. 4 
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Fig. 6. — Illustration of the transformation from AT to A parameterization. The thick 
solid lines show C = C(AT) (top left panel) C = C(A) (top right panel) and A = A(AT) 
(bottom left panel) for the QMASK map. The thin solid lines illustrate the transformation. 
The dashed lines show similar transformation for a randomly chosen Gaussian map. 
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Fig. 7. — The global Minkowski functionals of the parent Gaussian field u and derived non- 
Gaussian field w = exp u are shown in two left hand side panels as functions of the level. Both 
the perimeter and genus remain the same for both fields if they are parameterized by the 
total area (two bottom right hand side panels). All information about the non-Gaussianity 
of the w-field is stored in the cumulative probability function (dashed lines in the top panels). 
The solid lines in the top panels show the Gaussian cumulative probability function. 
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Fig. 8. — The figure is similar to Fig. 4, except that all morphological parameters are 
functions of A = A ES /A m , where A ES and A m = A ES (—oo) is the area of the excursion set 
and that of the whole map, respectively. 



